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Abstract 

Imagine a sequence in which the first letter comes from a binary alphabet, the 
second letter can be chosen on an alphabet with 10 elements, the third letter can 
be chosen on an alphabet with 3 elements and so on. When such a sequence can 
be called random? In this paper we offer a solution to the above question using the 
approach to randomness proposed by Algorithmic Information Theory. 



1 Varying Alphabets and the Cantor Expansion 



Algorithmic Information Theory (see [H Em ) deals with random sequences over a finite 
(not necessarily binary) alphabet. A real number is random if its binary expansion is a 
binary random sequence; the choice of base is irrelevant (see ^ for various proofs). 

Instead of working with a fixed alphabet we can imagine that the letters of a sequence are 
taken from a fixed sequence of alphabets. This construction was introduced by Cantor 
as a generalization of the b-aiy expansion of reals. More precisely, let 



bi,b2, 



be a fixed infinite sequence of positive integers greater than 1. Using a point we form 
tlie finite or infinite sequence 

O.X1X2... (1) 

such that < < 6„ — 1, for all n > 1. Consider the set of rationals 

Xl X\ X2 Xfi . , 

•^1 = 1~'^2 = 7- + 171-' ■ ■ ■ 'Sn = Sn-1 + TTT J~' i^) 

bi bi 6162 O1O2 ■■■bn 

The above sum is bounded from above by 1, 

bj-l 

6162 ...bi ~ 6162 ...bn 



i=l 



SO there is a unique real number a that is the least upper bound of all partial sums 
The sequence is called the Cantor expansion of the real a G [0, 1]. 

If Xn = bn — 1, for all n > 1, then s„ = 1 — 1/(6162 . . . bn), so a = 1. If 6,„ = 6, for all 
n > 1, then the Cantor expansion becomes the classical 6-ary expansion. If a;„ = 1 and 
bn = n + 1, for all n > 1, then a = e. 

The genuine strength of the Cantor expansion unfolds when various choices and interac- 
tions on different scales are considered. 

The main result regarding Cantor expansions is: 



Theorem 1 Fix an infinite sequence of scales 61, 62) • • •• Assume that we exclude Cantor 
expansions in which starting from some place after the point all the consecutive digits are 
Xn = bn — ^. Then, every real number a E [0, 1] has a unique Cantor expansion (relative 
to 61, 62! • • J cmd its digits are determined by the following relations: 

Pl = a,Xi = \ bipi\,pn+l = bnPn " Xn,Xn+l = [6.„+ip„+iJ. 



Consequently, if we exclude Cantor expansions in which starting from some place after 
the point all the consecutive digits are x„ = 6„ — 1. then given a £ [0, 1] there is a 
unique sequence x" € X^^^ whose Cantor expansion is exactly a. If x € X*^-'^^ then we 
denote by the real whose Cantor digits are given by the sequence x, hence x"'' = x 
and a^°' = a. 

For more details regarding the Cantor expansion see 1^ . 



2 Examples 



First, following 14^ we consider the British system in which length can be measured 
in miles, furlongs, chains, yards, feet, hands, inches, lines. These scales relate in the 
following way: 1 mile = 8 furlongs = 8-10 chains = 8 • 10 • 22 yards = 8 • 10 • 22 • 3 feet = 
8-10-22-3-4 hands = 8-10-22-3-4-3 inches = 8- 10-22-3-4-3- 12 lines. Hence, the sequence 
of scales starts with 61 = 10, 62 = 8, 63 = 10, 64 = 22, 65 = 3, 65 = 4, 67 = 3,bs = 12 and 
can be continued ad infinitum. For example, the number 0.963(11)232(10)00 ••• •• • 
represents a length of 9 miles, 6 furlongs, 3 chains, 11 yards, 2 feet, 3 hands, 2 inches 
and 10 lines. 

For our second example we consider a ball in gravitational fall impinging onto a board 
of nails with different numbers 6,1 + 1 of nails at different horizontal levels (here, n 



stands for the nth horizontal level and 5„ is the basis corresponding to the position n). 
Let us assume that the layers are "sufficiently far apart" (and that there are periodic 
boundary conditions realizable by elastic mirrors). Then, depending on which one of 
the bn openings the ball takes, one identifies the associated number (counted from to 
bn — 1) with the nth position x„ € {0, . . . ,bn — 1} after the point. The resulting sequence 
leads to the real number whose Cantor expansion is 0. 2:1X2 • • • x„ • • •. 

As a third example we consider a quantum correspondent to the board of nails harness- 
ing irreducible complementarity and the randomness in the outcome of measurements on 
single particles. Take a quantized system with at least two complementary observables 
A, B, each one associated with different outcomes ai,bj, i,j G {0, . . . , — 1}, respec- 
tively. Notice that, in principle, N could be a large (but finite) number. Suppose further 
that A, B are "maximally" complementary in the sense that measurement of A totally 
randomizes the outcome of B and vice versa (this should not be confused with optimal 
mutually unbiased measurements |inj). 

A real number O.X1X2 • • • x„ • ■ ■ in the Cantor expansion can be constructed from suc- 
cessive measurements of A and B as follows. Since all bases 6„ used for the Cantor 
expansion are assumed to be bounded, choose A^ to be the least common multiple of all 
bases bn. Then partition the A^ outcomes into even partitions, one per different base, 
containing as many elements as are required for associating different elements of the nth 
partition with numbers from the set {0, . . . , 6„ — 1}. Then, by measuring 



successively, the nth position Xn G {0, . . . , 6„ — 1} can be identified with the number 
associated with the element of the partition which contains the measurement outcome. 

As an example, consider the Cantor expansion of a number in the bases 2, 6, and 9. As 
the least common multiple is 18, we choose two observables with 18 different outcomes; 
e.g., angular momentum components in two perpendicular directions of a particle of total 
angular momentum |/i with outcomes in (units are in h) 



Associate with the outcomes the set {0, 1,2,... , 17} and form the even partitions 



(or any partition obtained by permutating the elements of {0, 1,2,..., 17}) associated 
with the bases 2, 6, and 9, respectively. 

Then, upon successive measurements of angular momentum components in the two per- 
pendicular directions, the outcomes are translated into random digits in the bases 2, 6, 
and 9, accordingly. 

As the above quantum example may appear "cooked up", since the coding is based 
on a uniform radix N expansion, one might consider successive measurements of the 
location and the velocity of a single particle. In such a case, the value x„ is obtained 
by associating with it the click in a particular detector (or a range thereof) associated 
with spatial or momentum measurements. Any such arrangements are not very different 
in principle, since every measurement of a quantized system corresponds to registering a 
discrete event associated with a detector click . 



A,B,A,B,A,B,... 




{{0, 1, 2, 3, 4, 5, 6, 7, 8}, {9, 10, 11, 12, 13, 14, 15, 16, 17}}, 
{{0, 1, 2}, {3, 4, 5}, {6, 7, 8}, {9, 10, 11}, {12, 13, 14}, {15, 16, 17}}, 
{{0, 1}, {2, 3}, {4, 5}, {6, 7}, {8, 9}, {10, 11}, {12, 13}, {14, 15}, {16, 17}} 



3 Notation and Basic Results 



We consider IN to be the set of non-negative integers. The cardinahty of the set A is 
denoted by card (^4). The base 2 logarithm is denoted by log. 

If X is a set, then X* denotes the free monoid (under concatenation) generated by X with 
e standing for the empty string. The length of a string w G X* is denoted by \w\. We 
consider the space X'^ of infinite sequences (w-words) over X. If x = xiX2 . . . Xn . . . G X^ , 
then x(n) = X1X2 ■ ■ - Xn is the prefix of length n of x. Strings and sequences will be 
denoted respectively by x,u,v,v,w,... and x, y, .... For w,v & X* and x G X^ let 
wv,wx be the concatenation between w and v,x, respectively. 

By "C" we denote the prefix relation between strings: w Q v there is a v' such that 
wv' = V. The relation "C" is similarly defined for w G X* and x G X^: IZ x if 
there is a sequence x' such that wx' = x. The sets pref(x) = {w : w G X*,w C x} 
and pref{B) = IJ^g^pref(x) are the languages of prefixes of x G X^ and B C X^ , 
respectively. Finally, wX'^ = {x G X^ : w G pref(x)}. The sets {wX'^)^^x* define the 
natural topology on X^ . 

Assume now that X is finite and has r elements. The unbiased discrete measure on X 
is the probabilistic measure h{A) = card(j4)/r, for every subset of X. It induces the 
product measure /i defined on all Borel subsets of X'^. This measure coincides with the 
Lebesgue measure on the unit interval, it is computable and iJ,{wX^) = r~'"'l, for every 
w G X*. For more details see jSHim. 

In dealing with Cantor expansions we assume that the sequence of bases bi,b2, ■ ■ - bn, ■ ■ ■ 
is computable, i.e. given by a computable function / : IN ^ IN \ {0, 1}. Let Xi = 
{0, . . . , /(i) — 1}, for i > 2, and define the space 

00 

X^f^ =YIX,C]N'^. 

i=l 

The set 

prei(X^^^) = {w : w = W1W2 . . .Wn,Wi € Xi,l < i < n} 
plays for X^-^^ the role played by X* for X"^. 

Prefixes of a sequence x G X^^^ are defined in a natural way and the set of all (admissible) 
prefixes will be denoted by pref(x). As we will report any coding to binary, the length 
of w = W1W2 . . .Wn & pref (X^-'^)) is || w ||= log(niLi /(O)! 1^1 = ^- ^^^^ ^^e topology 
is induced by the sets [w]f = {x G X^ : w G pref(x)} and the corresponding measure is 
defined by 

\w\ 

j=i 

for every w G pref(X'''). An open set is of the form [A]f = {x : 3n(x(n) G A)}, for some 
set A C pref(X(/)). The open set [A]f is computably enumerable if A is computably 
enumerable. Only the equivalence between the notions of Cantor-randomness and weakly 
Chaitin-Cantor-randomness will be proven 

The following two lemmas will be useful: 

Lemma 2 Let < a < 2"^ and let a, (3 he two reals in the interval [a-2~"^, (a + l)- 2~™]. 
Then, the first m bits of a and (3 coincide, i.e., if a = X^^i 2;i2~' and (3 = X^^iyj2^% 
then Xi = Ui, for all i = 1,2, ... ,m. 



Lemma 3 Let 61, 62, • • • be an infinite sequence of scales and a = j / (6162 • • • &m) G [0, !]• 
Let a, (3 be two reals in the interval [a, a + 1/(6162 ... 6m)]- Then, the first m dig- 
its of the Cantor expansions (relative to 61,62,...^ of a and (5 coincide, i.e., if a = 
Si^i Xi/{bib2 ...bi) and /? = Yl'^i Vi/ibib^ ■ ■ ■ bi), then Xj = yi, for all i = 1,2, ... ,m. 

4 Definitions of a Random Sequence Relative to the Cantor 
Expansion 

In this section we propose five definitions for random sequences relative to their Cantor 
expansions and we prove that all definitions are mutually equivalent. We will fix a 
computable sequence of scales /. 

Wc say that the sequence x G X-^ is Cantor-random if the real number is random (in 
the sense of Algorithmic Information Theory), e.g., the sequence corresponding to the 
binary expansion of a is random. 

Next we define the notion of weakly Chaitin-Cantor random sequence. To this aim we 
introduce the Cantor self-delimiting Turing machine (shortly, a machine), which is a 
Turing machine C processing binary strings and producing elements of pref (X*^-'^^) such 
that its program set (domain) PROGc = {x G {0,1}* : C(x) halts} is a prefix-free set 
of strings. Sometimes we will write C{x) < 00 when C halts on x and C(x) = 00 in the 
opposite case. 

The program-size complexity of the string w G pref(X(-'')) (relative to C) is defined by 
Hc{w) = min{|t;| : v G S*, C{y) = w}, where min0 = 00. As in the classical situation 
the set of Cantor self-delimiting Turing machines is computably enumerable, so we can 
effectively construct a machine U (called universal) such that for every machine C, 
Hu{x) < Hc{x) -\- 0(1). In what follows we will fix a universal machine U and denote 
Hu simply by H. 

The sequence x G is weakly Chaitin- Cantor-random if there exists a positive constant 
c such that for all n G IN, iJ(x(n)) >|| a; || — c. 

The sequence x G is strongly Chaitin- Cantor-random if the following relation holds 
true: lim„_»oo(-f^(x(n))— || x ||) = 00. 

The sequence x G is Martin- Lof- Cantor-random if for every computably enumerable 
collection of computably enumerable open sets {On) in X^-l'^ such that for every n G IM, 
/x(On) < 2-" we have x ^ n^^^On. 

The sequence x G X^ is Solovay- Cantor-random if for every computably enumerable 
collection of computably enumerable open sets {On) in such that J2n=i /^(^n) < 00 
the relation x G On is true only for finitely many n G IN. 

Theorem 4 Let x G X^'f^ Then, the following statements are equivalent: 

1. The sequence x is weakly Chaitin- Cantor-random. 

2. The sequence x is strongly Chaitin- Cantor-random. 

3. The sequence x is Martin-Lof- Cantor-random. 

4. The sequence x is Solovay-Cantor-random. 



These equivalences are direct translations of the classical proofs (see, for example, ^). 
Moreover, we have the following relations. 



Theorem 5 Lei x G X^^h Then, the sequence x is weakly Chaitin- Cantor-random if 
X is Cantor-random. If the function f is bounded, then every weakly Chaitin- Cantor- 
random X is also Cantor-random sequence. 

Proof. The argument is modification of the proof idea of Theorem 3 in [U] . 

Assume first that x G X^^^ is not Cantor-random and let a = a^. Let y = yiy2 ■ ■ ■ 
be the bits of the binary expansion of a. We shall show that y is not a binary random 
sequence. 

Fix an integer m > 1 and consider the rational 

m 

/ \ X *■ Xi 

a 



. bib2...bm 

1=1 

We note that w = xiX2 . ■ . Xm is in pref (X^-^^) and || w |[= log(6i62 . . . 6m)- Further on, 
< a[m) < a and 

oo oo ; ^ ^ 

a — a{m) < > < > = 

Next we define the following parameters: 

Mm=[log{bib2...bm)\, (3) 

am= Km).2^^™J. (4) 

and we note that 

a-a(m) < , , ^ , < 2'^"^. (5) 
6162 ...bm 

We are now in a position to prove the relation: for every integer m > 1, 

Hm), a] C [am • 2"*^'", (a^ + 2) • 2^^^™) . (6) 
Indeed, in view of © and Q we have a < (cm + 2) • 2"^^™ as: 

a ■ 2"^'^'" < a{m) ■ 2"*^"^ + 1 < + 2. 

Again from (HJ, Cm < a{m) ■ 2*^™. 

Using ®, from w = X1X2 • . . Xm plus two more bits we can determine yiy2 ■ ■ ■ UMmi that 
is, from the first m digits of the Cantor expansion of a and two additional bits we can 
compute the first Mm binary digits of a. In view of Lemma [21 we obtain a computable 
function h which on an input consisting of a binary string v of length 2 and w produces 
as output y(Mm). 

We are ready to use the assumption that y is random but x is not Cantor-random, that 
is, there is a universal self-delimiting Turing machine U"^ working on binary strings and 
there is a positive constant c such that for all n > 1, 



Hu^{y(.n)) >n-c, 



(7) 



and for every positive d there exists a positive integer Id (depending upon d) such that 

H{^{k))<\\^ild)\\-d. (8) 

We construct a binary self-dehmiting Turing machine such that for every d > 0, 
there exist two strings and v,si^ G {0,1}*, such that \v\ = 2, < || x{ld) \\ —d = 
log(6i62 • . . bi^) - d and C^{v, = y(M,J. 

Consequently, in view of (O and (jSJ, for every d we have: 

Mi^-c < Hu2iy{MiJ) 

< Hc2iy{MiJ) + 0il) 

< \siJ + 2 + 0il) 

< log(6i62 . . . 6i J + 0(1) 
= Mi^ + 0(1) - d, 



a contradiction. 

Recall that a = ^«/(^i^2 • • • ^j) = YliZi Vi'^'^ ■ Now we prove that x is Cantor- 

random whenever y is random. Let m > 1 be an integer and let a2{Tn) = X^ilLi^i^ *• 
Given a large enough m we effectively compute the integer to be the maximum integer 
L > 1 such that 



< 



1 



6162 ■■■bi 

We continue by proving that for all large enough m > 1: 

1 



[a2{m),a] C 
We note that a2{'m) < a and 



a{tr. 



b,b 



102 



6162 



(9) 



(10) 



Xi 



i=l 



bib2 ...bi 



< a 



Xi 



i=ti 



6162 ...bi 



b' 1 

< a(tm)+V , , < a(t„)+ 

^ O1O2 . . . Oj 



1 



bib2 ...bt„ 



As a < a{tm) + l/(^ife2 • • • &tm) we only need to show that a{tm) < a2("T') + 
1/(6162 •• • bt^). This is the case as otherwise, by ©, we would have: 



a{tm) > a2{m) + 



1 



6162 ...bt. 



> a2(m) + 2-"" > a, 



a contradiction. 



In case when / is bounded, assume by contradiction that x is Cantor-random but y is 
not random, that is there exists a positive constant c such that for all n > 1 we have: 



//(x(n)) >log(6i62...6„)-c, 
and for every d > there exists an integer > such that 

Hij2{y{nd)) <nd- d. 



(11) 



(12) 



In view of Lemma |21 and ()1U() there is a computable function F depending upon two 
binary strings such that \v\ = 2, F{y{nd),v) = x(i„^), so the partially computable 



function F o U'^ which maps binary strings in elements of pref(X(''^^) is a Cantor self- 
deUmiting Tm'ing machine such that for every d > there exists a binary string Sn^ of 
length less than rid — d and a binary string v of length 2 such that F{U'^{sn^),v) = x(t„^). 

As / is bounded, the difference | tm+i — I is bounded. In view of @, for large m > 1, 
6162 . . . bt^ > m — 1, so we can write: 

Ud-c-l < log(6i62 • • • bt„^ ) - c 

< Hp,u<x{tn,)) + 0(1) 

< |s„J + 2 + 0(1) 

< nd-d + 0{l), 

a contradiction. q.e.d. 



Open Question 6 It is an open question whether the above result holds true for un- 
bounded functions f. 



Consider the following statement: 



Let X be a binary sequence. If there exists a computable infinite set M of positive 
integers and c > such that for every m G M, Hu2(x{m)) > m — c, then x is random. 



Note that if the above statement would be true, then the answer to the Open Question 
would be affirmative. 

It is interesting to note that in case of unbounded functions / we may have Cantor- 
random sequences x G X^^^ which do not contain a certain letter, e.g. G Xj. 



Example 7 Let f{i) = 2*+^. Then the measure of the set F = Y^S^i^'i, where X[ = 
Xi \ {0} satisfies fJ-{F) = Yli^ii^ ~ 2^*^^) > 0. Thus F contains a Cantor-random 
sequence x. 

However, by construction, x does not contain the letter which is in every Xi. 



5 On the Meaning of Randomness in Cantor's Setting 

So far, a great number of investigations have concentrated on the meaning and definition 
of randomness in the standard context, in which bases remain the same at all scales. 
That is, if one for instance "zooms into" a number by considering the next place in its 
expansion, it is always taken for granted that the same base is associated with different 
places. 

From a physical viewpoint, if one looks into a physical property encoded into a real in, 
say, fixed decimal notation, then by taking the next digit amounts to specifying that 
physical property more precisely by a factor of ten. A fixed "zoom" factor may be 
the right choice if all physical properties such as forces and symmetries and boundary 
conditions remain the same at all scales. But this is hardly to be expected. Take, for 
instance, a "fractal" coastline. How is it generated? The origins of its geometry are 
the forces of the tidal and other forces on the land and coastal soil. That is, water 



moving back and forth, forming eddies, washing out httle bays, and httle bays within 
Httle bays, and Httle bays within httle bays within little bays, . . . and so on. There may 
be some structural components of this flow which results in scale dependence. Maybe 
the soil-water system forming the landscape will be "softer" at smaller scales, making 
bays relatively larger that their macroscopic counterparts. Indeed, eventually, at least 
at subatomic scales, the formation of currents and eddies responsible for the creation of 
ever smaller bays will break down. 

In such cases, the base of the expansion might have to be modified in order to be able 
to maintain a proper relation between the coding of the geometric object formed by the 
physical system and the meaning of its number representation in terms of "zooming". 
All such processes are naturally stochastic, and therefore deserve a proper and precise 
formalization in terms of random sequences in Cantor representations. 
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